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SEQUENCE LISTING 



(1) GENERAL INFORMATION 



(i) APPLICANT: JOYCE, JAMES G. 



GEORGE, HUGH A. 

HOFMANN, KATHRYN J. 
JANSEN, KATHRIN U. 
NEEPER, MICHAEL P. 



(ii) TITLE OF THE INVENTION: RECOMBINANT PAPILLOMAVIRUS TYPE 18 



(iii) NUMBER OF SEQUENCES: 16 

( iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: CHRISTINE E. CARTY - MERCK & CO . , INC. 

(B) STREET: 126 EAST LINCOLN AVENUE - P.O. BOX 2000 

(C) CITY: RAHWAY 

(D) STATE: NJ 

(E) COUNTRY: US 

(F) ZIP: 07065-0907 

(v) COMPUTER READABLE FORM: 
(A) ..MEDIUM. TYPE.: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/409,122 

(B) FILING DATE: 22-MAR-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: CARTY, CHRISTINE E 

(B) REGISTRATION NUMBER: 36,099 

(C) REFERENCE /DOCKET NUMBER: 19425 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 908-594-6734 

(B) TELEFAX: 908-594-4720 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 
(i) SEQUENCE CHARACTERISTICS: 



VACCIUNE 
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(A) LENGTH: 1524 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



ATGGC TTTGT 


CCCCCCC 1 AC 


mp A P A A rp A O O 

IXjAC AA I ACC 


0 m a m a 0 0 mmo 
CI Al ACC 11C 


0 a 0 0 mo 0 mmo 
CACC ICC 11C 


mo TV -1 OO A AO A 


D KJ 


pmnvm* A A m A 

G 1 I\j 1 AAA 1 A 


C 1\jA rc a 1 1 a 


rnprpp A prnppp 
1 Vs 1\jAC 1 CCC 


a 0 a a 00 a m a m 
AC AACC A 1 A 1 


mmm a mo a mo o 
1 1 1 AlCArcc 


rnppp A oompm 




A O A mm A mm A A 

AC A I I A 1 1 AA 


omo rnrnr , r , T' A A 

C 1 C 1 1 CC 1 AA 


moo a m a mm mm 
1CCA1A1 111 


a ooommoomo 
ACCC 1 1 CC 1 \j 


O A O O T'O O HPO O 
C ACvj 1\jC 1\jvj 




1 80 

J. O V 


CA1 Al 1 CC 1 A 


A OPmTWPPmPP 

ACC 1 1 1 C IXjtL. 


a m a 00 a a m a m 
Ai ACCAA1 Al 


a 0 a om a mmmo 
AC AC 1 A 1 1 1 C 


p p p tv p "h. p f P t v 


a pomo A OOp A 




7\ A m A A A rnrprno 

AA1 AAA1 1 i\j 


pfpmm A porno A 
C 1 1 1 ACL 1\jA 


m a a m a pm a mm 
1AA1AC1A1 1 


m a m a a moo mo 
1 Al AA1CC 1 Cj 


A A AP A A OO 
AAACACAACC 


mmm ao r PO r PPP 
111 AL* l\j 1\jvj 


^ n n 


o o o nv 1 mo o mo 
CCC 1\j 1C»C I\j 


/-» 7. O mOO A A A rp 

CAC rCC AAA 1 


1\jCC CC 1\jC 1 


O 7\OOOmT»"n a O 

CACCC 1 1 1 AC 


omommoooom 
C 1\j 1 IvjCCC 1 


m a 0 mo 000 a m 

1 Avj 1 (jVjL>^H I 


^ 0 

J D U 


OO 71 rpmrnrp 71 rp 7, 

CCA1 J. 1 1 Al A 


A T A A A mm 

A1AAA1 I AC A 


mo a 0 a pmp A A 
1 CAC AC 1 CAA 


a ommooo a mo 
AC1 1CCCA1C 


0000mA 00 mo 
CCCC 1 ACC 1 C 


m A A rnprpmmpm 
1 AA lb 1 1 1L1 




CACC ACC 1 I a 


r~*(T~* a 0 a a rrv^m 
CCCAC AA IXj 1 


O rnp rpo m a 0 A rp 

C 1 C l\j 1 ACA 1 


rp A m A A O O A O A 

1 A 1 AACC AC A 


0 a 0 a 0 mm a mo 

cacaci lArc 


m a mmmmpooo 

1 A 1 1 1 ICiCrCC 


*i O (J 


rpr 1 tip poop mo 


pm^ rpTtPPOP A 
C 1 A I ltiCCCA 


a 0 a omoooom 
ACAC IXiCrCC 1 


AAA OOO A omO 

AAACCCAC i\st 


ommom a a at>p 
C 1 1C 1 AAA1C 


pppmppmmm a 
IjHj 1 1 1 1 A 




1 CAC ACCCCC 


A mmo r* r* f*> r* 
Al 1 CCCCCCC 


mmm a 0 a a 0 mm 
111 ACAAC 1 1 


A A O A A O A O A O 

AACAACACAC 


mmmmoo a a o a 
111 1\jCAACA 


TV^PTT" ATA mp 


DvU 


/-"'mAOAmAO mo 

C I AC A I AC I C 


p A m A fTV^OfTI^/^ 

CA1A1\j»C1CC 


0 a moo a ommm 
C A IXjCAC 111 


a 0 m a 0 a mmo 0 
AC 1 ACA 1 1\jC 


A AOAmA omA A 

AAC A 1 AC 1 AA 


a mo mo a 00m a 
A 1 vi 1 CACC 1 A 




CCATTGGATA 


TTTGTCAGTC 


TATTTGTAAA 


TATCCTGATT 


ATTTACAAAT 


GTCTGCAGAT 


720 


CCTTATGGGG 


ATTCCATGTT 


TTTTTGCTTA 


CGACGTGAGC 


AGCTTTTTGC 


TAGGCATTTT 


780 


TGGAATAGGG 


CAGGTACTAT 


GGGTGACACT 


GTGCCTCAAT 


CCTTATATAT 


TAAAGGCACA 


840 


GGTATGCGTG 


CTTCACCTGG 


CAGCTGTGTG 


TATTCTCCCT 


CTCCAAGTGG 


CTCTATTGTT 


900 


ACCTCTGACT 


CCCAGTTGTT 


TAATAAACCA 


TATTGGTTAC 


ATAAGGCACA 


GGGTCATAAC 


960 


AATGGTATCT 


GCTGGCATAA 


TCAATTATTT 


GTTACTGTGG 


TAGATACCAC 


TCGTAGTACC 


1020 


AATTTAACAA 


TATGTGCTTC 


TACACAGTCT 


CCTGTACCTG 


GGCAATATGA 


TGCTACCAAA 


1080 


TTTAAGCAGT 


ATAGCAGACA 


TGTTGAAGAA 


TATGATTTGC 


AGTTTATTTT 


TCAGTTATGT 


1140 


ACT ATT AC TT 


TAACTGCAGA 


TGTTATGTCC 


TATATTCATA 


GTATGAATAG 


CAGTATTTTA 


1200 


GAGGATTGGA 


ACTTTGGTGT 


TCCCCCCCCG 


CCAACTACTA 


GTTTGGTGGA 


TACATATCGT 


1260 


TTTGTACAAT 


CTGTTGCTAT 


TACCTGTCAA 


AAGGATGCTG 


CACCAGCTGA 


AAATAAGGAT 


1320 


CCCTATGATA 


AGTTAAAGTT 


TTGGAATGTG 


GATTTAAAGG 


AAAAGTTTTC 


TTTGGACTTA 


1380 


GATCAATATC 


CCCTTGGACG 


TAAATTTTTG 


GTTCAGGCTG 


GATTGCGTCG 


CAAGCCCACC 


1440 


ATAGGCCCTC 


GTAAACGTTC 


TGCTCCATCT 


GCCACTACGT 


CTTCTAAACC 


TGCCAAGCGT 


1500 


GTGCGTGTAC 


GTGCCAGGAA 


GTAA 








1524 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ala Leu Trp Arg Pro Ser Asp Asn Thr Val Tyr Leu Pro Pro Pro 

15 10 15 

Ser Val Ala Arg Val Val Asn Thr Asp Asp Tyr Val Thr Arg Thr Ser 

20 25 30 

lie Phe Tyr His Ala Gly Ser Ser Arg Leu Leu Thr Val Gly Asn Pro 

35 40 45 

Tyr" Phe' *Arg Val" Pro" Ala "Gly "Gly "Gly' Asn "Lys *~Glh~ Aspf'tle Pro Lys 

50 55 60 

Val Ser Ala Tyr Gin Tyr Arg Val Phe Arg Val Gin Leu Pro Asp Pro 
65 70 75 80 

Asn Lys Phe Gly Leu Pro Asp Asn Ser He Tyr Asn Pro Glu Thr Gin 

85 90 95 

Arg Leu Val Trp Ala Cys Ala Gly Val Glu He Gly Arg Gly Gin Pro 

100 105 110 

Leu Gly Val Gly Leu Ser Gly His Pro Phe Tyr Asn Lys Leu Asp Asp 

115 120 125 

Thr Glu Ser Ser His Ala Ala Thr Ser Asn Val Ser Glu Asp Val Arg 

130 135 140 

Asp Asn Val Ser Val Asp Tyr Lys Gin Thr Gin Leu Cys He Leu Gly 
145 150 155 160 

Cys Ala Pro Ala He Gly Glu His Trp Ala Lys Gly Thr Ala Cys Lys 

165 170 175 

Ser Arg Pro Leu Ser Gin Gly Asp Cys Pro Pro Leu Glu Leu Lys Asn 

180 185 190 

Thr Val Leu Glu Asp Gly Asp Met Val Asp Thr Gly Tyr Gly Ala Met 
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195 200 205 

Asp Phe Ser Thr Leu Gin Asp Thr Lys Cys Glu Val Pro Leu Asp lie 

210 215 220 

Cys Gin Ser lie Cys Lys Tyr Pro Asp Tyr Leu Gin Met Ser Ala Asp 
225 230 235 240 

Pro Tyr Gly Asp Ser Met Phe Phe Cys Leu Arg Arg Glu Gin Leu Phe 

245 250 255 

Ala Arg His Phe Trp Asn Arg Ala Gly Thr Met Gly Asp Thr Val Pro 

260 265 270 

Gin Ser Leu Tyr lie Lys Gly Thr Gly Met Arg Ala Ser Pro Gly Ser 

275 280 285 

Cys Val Tyr Ser Pro Ser Pro Ser Gly Ser lie Val Thr Ser Asp Ser 

290 295 300 

Gin Leu Phe Asn Lys Pro Tyr Trp Leu His Lys Ala Gin Gly His Asn 
305 310 315 320 

Asn Gly lie Cys Trp His Asn Gin Leu Phe Val Thr Val Val Asp Thr 

- • - '-325 • ■ -330 -335 

Thr Arg Ser Thr Asn Leu Thr lie Cys Ala Ser Thr Gin Ser Pro Val 

340 345 350 

Pro Gly Gin Tyr Asp Ala Thr Lys Phe Lys Gin Tyr Ser Arg His Val 

355 360 365 

Glu Glu Tyr Asp Leu Gin Phe lie Phe Gin Leu Cys Thr lie Thr Leu 

370 375 380 

Thr Ala Asp Val Met Ser Tyr lie His Ser Met Asn Ser Ser lie Leu 
385 390 395 400 

Glu Asp Trp Asn Phe Gly Val Pro Pro Pro Pro Thr Thr Ser Leu Val 

405 410 415 

Asp Thr Tyr Arg Phe Val Gin Ser Val Ala lie Thr Cys Gin Lys Asp 

420 425 430 

Ala Ala Pro Ala Glu Asn Lys Asp Pro Tyr Asp Lys Leu Lys Phe Trp 

435 440 445 

Asn Val Asp Leu Lys Glu Lys Phe Ser Leu Asp Leu Asp Gin Tyr Pro 

450 455 460 

Leu Gly Arg Lys Phe Leu Val Gin Ala Gly Leu Arg Arg Lys Pro Thr 
465 470 475 480 



5% 




# 
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lie Gly Pro Arg Lys Arg Ser Ala Pro Ser Ala Thr Thr Ser Ser Lys 



485 



490 



495 



Pro Ala Lys Arg Val Arg Val Arg Ala Arg Lys 



500 



505 



(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



ATGGTATCCC 


ACCGTGCCGC 


ACGACGCAAA 


CGGGCTTCGG 


TGACTGACTT 


ATATAAAACA 


60 


TGTAAACAAT 


CTGGTACATG 


TCCATCTGAT 


GTTGTTAATA 


AGGTAGAGGG 


CACCACGTTA 


120 


GCAGATAAAA 


TATTGCAATG 


GTCAAGCCTT 


GGTATATTTT 


TGGGTGGACT 


TGGCATAGGT 


180 


ACTGGAAGTG 


GTACAGGGGG 


TCGTACAGGG 


TACATTCCAT 


TGGGTGGGCG 


TTCCAATACA 


240 


GTTGTGGATG 


TCGGTCCTAC 


ACGTCCTCCA 


GTGGTTATTG 


AACCTGTGGG 


CCCCACAGAC 


300 


CCATCTATTG 


TTACATTAAT 


AGAGGACTCA 


AGTGTTGTTA 


CATCAGGTGC 


ACCTAGGCCT 


360 


ACTTTTACTG 


GCACGTCTGG 


GTTTGATATA 


ACATCTGCTG 


GTACAACTAC 


ACCTGCAGTT 


420 


TTGGATATCA 


CACCTTCGTC 


TACCTCTGTT 


TCTATTTCCA 


CAACCAATTT 


TACCAATCCT 


480 


GCATTTTCTG 


ATCCGTCCAT 


TATTGAAGTT 


CCACAAACTG 


GGGAGGTGTC 


AGGTAATGTA 


540 


TTTGTTGGTA 


CCCCTACATC 


TGGAACACAT 


GGGTATGAAG 


AAATACCTTT 


ACAAACATTT 


600 


GCTTCTTCTG 


GTACGGGGGA 


GGAACCCATT 


AGTAGTACCC 


CATTGCCTAC 


TGTGCGGCGT 


660 


GTAGCAGGTC 


CCCGCCTTTA 


CAGTAGGGCC 


TACCAACAAG 


TGTCTGTGGC 


TAACCCTGAG 


720 


TTTCTTACAC 


GTCCATCCTC 


TTTAATTACC 


TATGACAACC 


CGGCCTTTGA 


GCCTGTGGAC 


780 


ACTACATTAA 


CATTTGAGCC 


TCGTAGTAAT 


GTTCCTGATT 


CAGATTTTAT 


GGATATTATC 


840 


CGTTTACATA 


GGCCTGCTTT 


AACATCCAGG 


CGTGGTACTG 


TGCGCTTTAG 


TAGATTAGGT 


900 


CAAAGGGCAA 


CTATGTTTAC 


CCGTAGCGGT 


ACACAAATAG 


GTGCTAGGGT 


TCACTTTTAT 


960 


CATGATATAA 


GTCCTATTGC 


ACCCTCCCCA 


GAATATATTG 


AACTGCAGCC 


TTTAGTATCT 


1020 


GCCACGGAGG 


ACAATGGCTT 


GTTTGATATA 


TATGCAGATG 


AC AT AG AC CC 


TGCAATGCCT 


1080 


GTACCATCGC 


GTCCTACTAC 


CTCCTCTGCA 


GTTTCTACAT 


ATTCGCCCAC 


TATATCATCT 


1140 


GCCTCTTCCT 


ATAGTAATGT 


AACGGTCCCT 


TTAACCTCCT 


CTTGGGATGT 


GCCTGTATAC 


1200 
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ACGGGTCCTG ATATTACATT ACCACCTACT ACCTCTGTAT GGCCCATTGT ATCACCCACA 1260 

GCCCCTGCCT CTACACAGTA TATTGGTATA CATGGTACAC ATTATTATTT GTGGCCATTA 1320 

TATTATTTTA TTCCTAAAAA GCGTAAACGT GTTCCCTATT TTTTTGCAGA TGGCTTTGTG 1380 

GCGGCCTAG 1389 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Val Ser His Arg Ala Ala Arg Arg Lys Arg Ala Ser Val Thr Asp 

15 10 15 

Leu Tyr Lys Thr Cys Lys Gin Ser Gly Thr Cys Pro Ser Asp Val Val 

20 25 30 

Asn Lys Val Glu Gly Thr Thr Leu Ala Asp Lys lie Leu Gin Trp Ser 

35 40 45 

Ser Leu Gly lie Phe Leu Gly Gly Leu Gly lie Gly Thr Gly Ser Gly 

50 55 60 

Thr Gly Gly Arg Thr Gly Tyr lie Pro Leu Gly Gly Arg Ser Asn Thr 
65 70 75 80 

Val Val Asp Val Gly Pro Thr Arg Pro Pro Val Val He Glu Pro Val 

85 90 95 

Gly Pro Thr Asp Pro Ser He Val Thr Leu He Glu Asp Ser Ser Val 

100 105 110 

Val Thr Ser Gly Ala Pro Arg Pro Thr Phe Thr Gly Thr Ser Gly Phe 

115 120 125 

Asp He Thr Ser Ala Gly Thr Thr Thr Pro Ala Val Leu Asp He Thr 

130 135 140 

Pro Ser Ser Thr Ser Val Ser He Ser Thr Thr Asn Phe Thr Asn Pro 
145 150 155 160 



Uo 



-40- 19425 

Ala Phe Ser Asp Pro Ser He He Glu Val Pro Gin Thr Gly Glu Val 

165 170 175 

Ser Gly Asn Val Phe Val Gly Thr Pro Thr Ser Gly Thr His Gly Tyr 

180 185 190 

Glu Glu lie Pro Leu Gin Thr Phe Ala Ser Ser Gly Thr Gly Glu Glu 

195 200 205 

Pro He Ser Ser Thr Pro Leu Pro Thr Val Arg Arg Val Ala Gly Pro 

210 215 220 

Arg Leu Tyr Ser Arg Ala Tyr Gin Gin Val Ser Val Ala Asn Pro Glu 
225 230 235 240 

Phe Leu Thr Arg Pro Ser Ser Leu He Thr Tyr Asp Asn Pro Ala Phe 

245 250 255 

Glu Pro Val Asp Thr Thr Leu Thr Phe Glu Pro Arg Ser Asn Val Pro 

260 265 270 

Asp Ser Asp Phe Met Asp He He Arg Leu His Arg Pro Ala Leu Thr 

275 280 285 

Ser Arg Arg Gly --Thr- Val* Arg -Phe -Ser- Arg Leu- Gly- Gin -Arg -Ala- Thr 

290 295 300 

Met Phe Thr Arg Ser Gly Thr Gin He Gly Ala Arg Val His Phe Tyr 
305 310 315 320 

His Asp He Ser Pro He Ala Pro Ser Pro Glu Tyr He Glu Leu Gin 

325 330 335 

Pro Leu Val Ser Ala Thr Glu Asp Asn Gly Leu Phe Asp lie Tyr Ala 

340 345 350 

Asp Asp He Asp Pro Ala Met Pro Val Pro Ser Arg Pro Thr Thr Ser 

355 360 365 

Ser Ala Val Ser Thr Tyr Ser Pro Thr He Ser Ser Ala Ser Ser Tyr 

370 375 380 

Ser Asn Val Thr Val Pro Leu Thr Ser Ser Trp Asp Val Pro Val Tyr 
385 390 395 400 

Thr Gly Pro Asp He Thr Leu Pro Pro Thr Ser Val Trp Pro He Val. 

405 410 415 

Ser Pro Thr Ala Pro Ala Ser Thr Gin Tyr He Gly He His Gly Thr 
420 425 430 
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His Tyr Tyr Leu Trp Pro Leu Tyr Tyr Phe lie Pro Lys Lys Arg Lys 



435 



440 



445 



Arg Val Pro Tyr Phe Phe Ala Asp Gly Phe Val Ala Ala 



450 



455 



460 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GAAGATCTCA CAAAACAAAA TGGCTTTGTG GCGGCCTAGT G 41 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAAGATCTTT ACTTCCTGGC ACGTACACGC ACACGC 3 6 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 





ft 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



TCCCCCGGGC ACAAAACAAA ATGGTATCCC ACCGTGCCGC ACGAC 



45 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

TCCCCCGGGC TAGGCCGCCA CAAAGCCATC TGC 33 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 

-■-(B) TYPE:-- nucleic- acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CAATCCTTAT ATATTAAAGG CACAGGTATG 3 0 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 





t 
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CATCATATTG CCCAGGTACA GGAGACTGTG 



30 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
GAAGATCTCA CAAAACAAAA TGGCTTTGTG GCGGCCTAGT G 41 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: -linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCTAACGTCC TCAGAAACAT TAGAC 25 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
CTTAAAGCTT ATGTCACTTT CTCTTGTATC 3 0 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
TGATAAGCTT GCTCAATGGT TCTCTTCCTC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

■ (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TGGTCATCCC AAATCTTGAA A 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CACCGTAGTG TTTGGAAGCG A 



